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Abstract. We investigate the power of spectral synthesis as a mean to es- 
timate physical properties of galaxies. Spectral synthesis is nothing more than 
the decomposition of an observed spectrum in terms of a superposition of a base 
of simple stellar populations of various ages and metallicities (here from Bruzual 
& Chariot 2003), producing as output the star- formation and chemical histories 

■ of a galaxy, its extinction and velocity dispersion. We discuss the reliability of 
| this approach and apply it to a volume limited sample of 50362 galaxies from 

■^j- , the SDSS Data Release 2, producing a catalog of stellar population properties. 

■ Emission lines are also studied, their measurement being performed after sub- 
tracting the computed starlight spectrum from the observed one. A comparison 

, with recent estimates of both observed and physical properties of these galaxies 

■ obtained by other groups shows good qualitative and quantitative agreement, 
\ despite substantial differences in the method of analysis. The confidence in the 

method is further strengthened by several empirical and astrophysically rea- 
sonable correlations between synthesis results and independent quantities. For 
instance, we report the existence of strong correlations between stellar and neb- 
+3 , ular metallicites, stellar and nebular extinctions, mean stellar age and equivalent 

£2 ' width of Ha and 4000 A break, and between stellar mass and velocity dispersion. 

\ We also present preliminary results of an analysis of a magnitude-limited sample 

^ . which clearly reveals that the bimodality of galaxy populations is present in the 

J^j ' parameters computed in the synthesis. Our results are also consistent with the 

rS | "down-sizing" scenario of galaxy formation and evolution. Finally, we point out 

. one of the major problems facing spectral synthesis of early-type systems: the 

spectral base adopted here is based on solar-scaled evolutionary tracks whose 
abundance pattern may not be appropriate for this type of galaxy. 



1. Introduction 

In the proceedings of the meeting on Stellar Populations held in Baltimore in 
1986 Searle (1986), discussing integral light spectral synthesis, states that "this 
subject has a bad reputation. Too much has been claimed, and too few have 
been persuaded". Indeed, recovering the stellar content of a galaxy from its 
observed integrated spectrum is not an easy task, as can be deduced from the 
amount of work devoted to this topic over the past half century. The situation 
is however much more favorable nowadays. Huge observational and theoretical 
efforts in the past few years have produced large sets of high quality spectra 
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of stars (e.g., Prugniel & Soubiran 2001; Le Borgne etal. 2003; Bertone et al. 
2004; Gonzalez-Delgado etal. 2005). These libraries are being implemented in 
a new generation of evolutionary synthesis models, allowing the prediction of 
galaxy spectra with an unprecedented level of detail (Vazdekis 1999; Bruzual 
& Chariot 2003, hereafter BC03; Le Borgne etal. 2004). At the same time, 
galaxy spectra are now more abundant than ever. The Sloan Digital Sky Survey 
(SDSS), in particular, is providing a homogeneous data base of hundreds of 
thousands of galaxy spectra in the 3800-9200 A range, with a resolution of 
A/AA - 1800 (York etal. 2000; Stoughton etal. 2002; Abazajian etal. 2003, 
2004). This enormous amount of high quality data will undoubtedly be at 
the heart of tremendous progress in our understanding of galaxy constitution, 
formation and evolution. Actually, galaxy spectra encode information on the age 
and metallicity distributions of the constituent stars, which in turn reflect the 
star-formation and chemical histories of the galaxies. Retrieving this information 
from observational data in a reliable way is crucial for a deeper understanding 
of galaxy formation and evolution and, in fact, significant steps in this direction 
have recently been made (Kauffmann et al. 2003a, hereafter K03; Brinchmann 
et al. 2004; Tremonti et al. 2004; Heavens et al. 2004; Panter, Heavens &: Jimenez 
2004). 

Here we demonstrate that, besides providing excellent starlight templates 
to aid emission line studies, spectral synthesis recovers reliable stellar population 
properties out of galaxy spectra of realistic quality. We show that this simple 
method provides robust information on the stellar age (t*) and stellar metal- 
licities (Z+) distributions, as well as on the extinction, velocity dispersion and 
stellar mass. The ability to recover information on Z± is particularly welcome, 
given that stellar metallicities are notoriously more difficult to assess than other 
properties. In order to reach this goal we follow: (1) a priori arguments, based 
on simulations; (2) comparisons with independent work based on a different 
method, and (3) an a posteriori empirical analysis of the consistency of results 
obtained for a large sample of SDSS galaxies. Most of the results presented here 
are discussed in detail in Cid Fernandes et al. (2005). 

In Section 2 we present an overview of our synthesis method and simulations 
designed to test it and evaluate the uncertainties involved. The discussion is 
focused on how to use the synthesis to derive robust estimators of physically 
interesting stellar population properties. Section 3 defines a volume limited 
sample of SDSS galaxies and presents the results of the synthesis of their spectra, 
along with measurements of emission lines. We also compare in this section our 
results to those obtained by other authors. Stellar population and emission line 
properties are used in Section 4 to investigate whether the synthesis produces 
astrophysically plausible results. In Section 5 we present preliminary results of 
an analysis of a magnitude-limited sample of 20000 galaxies also extracted from 
SDSS. Finally, Section 6 summarizes our main findings. 

2. Spectral Synthesis 
2.1. Method 

Our synthesis code, which we call STARLIGHT, was first discussed in Cid Fer- 
nandes et al. (2004, hereafter CF04). We fit an observed spectrum 0\ with 
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a combination of iV* Simple Stellar Populations (SSP) from the evolutionary 
synthesis models of BC03. Extinction is modeled as due to foreground dust, 
and parametrized by the V-band extinction Ay. The Galactic extinction law 
of Cardelli, Clayton & Mathis (1989) with Ry = 3.1 is adopted. Line of sight 
stellar motions are modeled by a Gaussian distribution G centered at velocity 
v* and with dispersion cr*. With these assumptions the model spectrum is given 
by 



M x = M\ 



3=1 



G(v*,a*) (1) 



where bj \ is the spectrum of the j th SSP normalized at Ao, r\ = l0~ 0A ( Ax ~ Ax o} 
is the reddening term, M\ is the synthetic flux at the normalization wavelength, 
x is the population vector and <8> denotes the convolution operator. Each com- 
ponent Xj (j = 1 . . . N±) represents the fractional contribution of the SSP with 
age tj and metallicity 1 Zj to the model flux at Ao- The base components can 
be equivalently expressed as a mass fractions vector jl. In this work we adopt a 
base with iV* = 45 SSPs, encompassing 15 ages between 10 6 and 1.3 x 10 10 yr 
and 3 metallicities: Z = 0.2, 1 and 2.5 Zq. Their spectra were computed with 
the STELIB library (Le Borgne etal. 2003), Padova 1994 tracks, and Chabrier 
(2003) IMF (see BC03 for details). 

The fit is carried out with a simulated annealing plus Metropolis scheme 
which searches for the minimum x 2 = J2\ [{Ox — M\)w\] 2 , where w^ 1 is the 
error in 0\. Regions around emission lines, bad pixels or sky residuals are 
masked out by setting w\ = 0. Pixels which deviate by more than 3 times the 
rms between 0\ and an initial estimate of M\ are also given zero weight. 

Fig. 1 illustrates the spectral fit obtained for a galaxy drawn from the SDSS 
database. The top-left panel shows the observed spectrum (thin line) and the 
model (thick), as well as the error spectrum (dashed). The bottom- left panel 
shows the 0\ — M\ residual spectrum, while the panels in the right summarize 
the derived star-formation history encoded in the age-binned population vector. 
This example, along with those in K03 and CF04, demonstrates that this simple 
method is capable of reproducing real galaxy spectra to an excellent degree of 
accuracy. 

An important application of the synthesis is to measure emission lines from 
the residual spectrum, as done by K03. Another, of course, is to infer stellar 
population properties from the fit parameters. Our central goal here is to in- 
vestigate whether spectral synthesis can also recover reliable stellar population 
properties. In the remainder of this section we address this issue by means of 
simulations. 

2.2. Robust description of the synthesis results 

The existence of multiple solutions is an old known problem in stellar population 
synthesis, and even superb spectral fits as that shown in Fig. 1 do not guarantee 



1 In this paper we follow the convention used in stellar evolution studies, which define stellar 
metallicities in terms of the fraction of mass in metals. In this system the Sun has = 0.02. 
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Figure 1. Spectral synthesis of a late type SDSS galaxy. Top-left: Observed 
(thin line), model (thick line) and error spectra (dashed line). Points along 
a horizontal line in the bottom of this panel indicate bad pixels (as given 
by the SDSS flag) or emission lines windows, both of which were masked 
out in the fits. Bottom-left: Residual spectrum. Masked regions are plotted 
with a lighter thin line. Right: Flux (top) and mass (bottom) fractions as a 
function of age. Some of the derived properties are listed in the top-right. x\ 
is the reduced % 2 , and A a is the mean relative difference between model and 
observed spectra; S/N refers to the region around Aq = 4020 A. 



that the resulting parameter estimates are trustworthy. There is, consequently, 
a need to assess the degree to which one can trust the parameters involved in 
the fit before using them to infer stellar populations properties. The spectral fits 
involve + 3 parameters: iV* — 1 of the x components (one degree of freedom 
is removed by the normalization constraint), M\ , Ay and the two kinematical 
parameters, and <r*. The reliability of parameter estimation is best studied 
by means of simulations which feed the code with spectra generated with known 
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parameters, add noise, and then examine the correspondence between input and 
output values. 

Simulations We have carried out simulations designed to test the method and 
investigate which combinations of the parameters provide robust results. Sev- 
eral sets of simulations were performed. Given our interest in modeling SDSS 
galaxies, here we focus on simulations tailored to match the characteristics of 
this data set. Test galaxies were built from the average x, Ay and a+ within 
65 boxes in the mean stellar age versus mean stellar metallicity plane obtained 
for the sample described in Section 3. These new simulations confirm previous 
results reported in CF04 that the individual components of x are very uncertain, 
so we skip a detailed comparison between Xi npu t and x ou t pu t and jump straight 
to results based on more robust descriptions of the synthesis output. 

Condensed population vector A coarse but robust description of the star-for- 
mation history of a galaxy may be obtained by binning x onto "Young" (tj < 10 8 
yr), "Intermediate-age" (10 8 < tj < 10 9 yr), and "Old" (tj > 10 9 yr) components 
(xy, xi and xo respectively). These age-ranges were defined on the basis of 
the simulations, by seeking which combinations of Xj's produce smaller input 
— output residuals. Our simulations show that these 3 components are very 
well recovered by the method, with uncertainties smaller than Axy = 0.05, 
Ax/ = 0.1, and Ax = 0.1 for S/N > 10. 

Mass, extinction and velocity dispersion We have also analyzed the input ver- 
sus output values of Ay, cr* and the stellar mass M*. The latter is not an explicit 
input parameter of the models, but may be computed from \x and the M*/L\ 
ratio of the different populations in the base. The uncertainties in the recovery 
of these parameters are AAy < 0.05 mag, AlogM* < 0.1 dex and Act* < 12 
kms" 1 for S/N > 10. 

Mean stellar age If one had to choose a single parameter to characterize the 
stellar population mixture of a galaxy, the option would certainly be its mean 
age. We define two versions of mean stellar age (the logarithm of the age, 
actually), one weighted by light 

AT* 

< log U >L= X 3 lo S t 3 ( 2 ) 

and another weighted by stellar mass 

AT* 

< log U >m= H lo S t 3 ( 3 ) 

j=l 

Note that, by construction, both definitions are limited to the 1 Myr-13 Gyr 
range spanned by the base. The mass weighted mean age is in principle more 
physical, but, because of the non-constant M/L of stars, it has a much less direct 
relation with the observed spectrum than < log t* >l- 

The simulations show that the mean age is a very robust quantity. The 
rms difference between input and output < logt* >l values is < 0.08 dex for 
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S/N > 10, and < 0.14 dex for < logi* >m- Although the uncertainties of 

< logt* >l and < logi* >m are comparable in absolute terms, the latter 
index spans a smaller dynamical range (because of the large M/L ratio of old 
populations), so in practice < logi* >l is the more useful of the two indices. 

Mean stellar metallicity Given an option of what to choose as a second pa- 
rameter to describe a mixed stellar population, the choice would likely be its 
typical metallicity. Analogously to what we did for ages, we define both light 
and mass-weighted mean stellar metallicities: 

<Z*> L =Y J x i z i ( 4 ) 
i=i 

and 

< Z±> M ='^2(j, j Z j (5) 
i=i 

both of which are bounded by the 0.2-2.5 Z & base limits. Our simulations show 
that the rms of A log < Z+ > M = log < >M,out P ut -log < Z± >M,input is 
of order 0.1 dex. In absolute terms this is comparable to A < logi* >, but 
note that < Z± > covers a much narrower dynamical range than < logi* >, so 
that in practice mean stellar metallicities are more sensitive to errors than mean 
ages. This is not surprising, given that age is the main driver of variance among 
SSP spectra, metallicity having a "second-order" effect (e.g., Schmidt etal. 1991; 
Ronen, Aragon-Salamanca & Lahav 1999). This is the reason why studies of the 
stellar populations of galaxies have a much harder time estimating metallicities 
than ages, to the point that one is often forced to bin-over the Z information 
and deal only with age-related estimates such as < logi* > (e.g., Cid Fernandes 
etal. 2001; Cid Fernandes, Leao & Rodrigues Lacerda 2003; K03). 

Notwithstanding these notes, it is clear that uncertainties of ~ 0.1 dex 
in < > are actually good news, since they do allow us to recover useful 
information on an important but hard to measure property. This new tracer of 
stellar metallicity is best applicable to large samples of galaxies such as the SDSS. 
The statistics of samples help reducing uncertainties associated with < Z* > 
estimates for single objects and allows one to investigate correlations between 

< Z+ > and other galaxy properties (Sodre etal. , in preparation; Section H.l.JI . 

Age-Metallicity degeneracy Our method tends to underestimate < Z+ > for 
metal-rich systems and vice-versa. This trend is due to the infamous age- 
metallicity degeneracy. In order to verify to which degree our synthesis is af- 
fected by this well known problem (e.g., Renzini & Buzzoni 1986; Worthey 1994; 
Bressan, Chiosi & Tantalo 1996) we have examined the correlation between the 
output minus input residuals in < log£* >l and log < Z+ >£. The age-Z 
degeneracy acts in the sense of confusing old, metal-poor systems with young, 
metal-rich ones and vice-versa, which should produce anti-correlated residuals. 
This anti-correlation is also present with the mass-weighted mean stellar metal- 
licity < Z* >Mi but is not as strong as for < Z+ >£. On the other hand, the 
uncertainty in < Z+ >u is always larger than for < >i. 
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The age-Z degeneracy is thus present in our method, introducing correlated 
residuals in our < Z* > and < logi* > estimates at the level of up to ~ 0.1-0.2 
dex. However, none of the results reported here rely on this level of precision. 

Base limitations We have carried out simulations using the Z = O.O2Z BC03 
SSPs to examine some of the limitations of the base addopted here. Galaxies 
with such low metallicity are not expected to be present in significant numbers 
in the sample described in Section 13.1.1 given that it excludes low luminosity 
systems like HII galaxies and dwarf ellipticals (which are also the least metal- 
lic ones by virtue of the mass-metallicity relation). Still, it is interesting to 
investigate what would happen in this case. When synthesized with our 0.2— 
2.5Zq base, these extremely metal poor galaxies are modeled predominantly 
with the 0.2Zq components, as intuitively expected. Moreover, the mismatch 
in metallicity introduces non-negligible biases in other properties, like masses, 
mean ages and extinction (M*, for instance, is systematically underestimated 
by 0.3 dex). Similar problems should be encountered when modeling systems 
with Z > 2.5Zq. These results serve as a reminder that our base spans a wide 
but finite range in stellar metallicity, and that extrapolating these limits has an 
impact on the derived physical properties. While there is no straightforward a 
priori diagnostic of which galaxies violate these limits, in general, one should be 
suspicious of objects with mean Z+ too close to the base limits. 

Our simulations demonstrate that we are able of producing reliable esti- 
mates of several parameters of astrophysical interest, at least in principle. We 
must nevertheless emphasize that this conclusion relies entirely on models and 
on an admittedly simplistic view of galaxies. When applying the synthesis to 
real galaxy spectra, a series of other effects come into play. For instance, the 
extinction law appropriate for each galaxy likely differs from the one used here. 
Similarly, while in the evolutionary tracks adopted here the metal abundances 
are scaled from the solar values, non-solar abundance mixtures are known to 
occur in stellar systems (e.g., Trager etal. 2000a,b; see also Section 5), not to 
mention uncertainties in the SSP models and the always present issue of the 
IMF. Accounting for all these effects in a consistent way is not currently feasi- 
ble. We mention these caveats not to dismiss simple models, but to highlight 
that all parameter uncertainties discussed above are applicable within the scope 
of the model. 

Hence, while the simulations lend confidence to the synthesis method, one 
might remain skeptical of its actual power. The next sections further address 
the reliability of the synthesis, this time from a more empirical perspective. 

3. Analysis of a volume-limited galaxy sample 

In this section we apply our synthesis method to a large sample of SDSS galaxies 
to estimate their stellar population properties. We also present measurements 
of emission line properties, obtained from the observed minus synthetic spectra. 
The information provided by the synthesis of so many galaxies allows one to 
address a long menu of astrophysical issues related to galaxy formation and 
evolution. Before venturing in the exploration of such issues, however, it is 
important to validate the results of the synthesis by as many means as possible. 
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Hence, the goal of this study is not so much to explore the physics of galaxies 
but to provide an empirical test of our synthesis method. 

3.1. Sample definition 

The spectroscopic data used in this work were taken from the SDSS. This survey 
provides spectra of objects in a large wavelength range (3800-9200 A) with mean 
spectral resolution A/AA ~ 1800, taken with 3 arcsec diameter fibers. The most 
relevant characteristic of this survey for our study is the enormous amount of 
good quality, homogeneously obtained spectra. The data analyzed here were 
extracted from the SDSS main galaxy sample available in the Data Release 
2 (DR2; Abazajian etal. 2004). This flux-limited sample consists of galaxies 
with reddening-corrected Petrosian r-band magnitudes r < 17.77, and Petrosian 
r-band half-light surface brightnesses ^50 < 24.5 mag arcsec" 2 (Strauss et al. 
2002). 

From the main sample, we first selected spectra with a redshift confidence 
> 0.35. Following the conclusions of Zaritsky, Zabludoff, & Willick (1995), we 
have imposed a redshift limit of z > 0.05 (trying to avoid aperture effects and 
biases; see e.g. Gomez et al. 2003) and selected a volume limited sample up to 
z = 0.1, corresponding to a r-band absolute magnitude limit of M(r) = —20.5. 
The absolute magnitudes used here are k-corrected with the help of the code 
provided by Blanton et al. (2003; kcorrect v3_2) and assuming the following 
cosmological parameters: Hq = 70 km s _1 Mpc -1 , O^f = 0.3 and = 0.7. 
We also restricted our sample to objects for which the observed spectra show a 
S/N ratio in g, r and i bands greater than 5. These restrictions leave us with a 
volume limited sample containing 50362 galaxies, which leads to a completeness 
level of ~ 98.5 per cent. 

3.2. Results of the spectral synthesis 

All 50362 spectra were brought to the rest-frame (using the redshifts in the SDSS 
database), sampled from 3650 to 8000 A in steps of 1 A, corrected for Galactic 
extinction 2 using the maps given by Schlegel, Finkbeiner & Davis (1998) and the 
extinction law of Cardelli etal. (1989, with Ry = 3.1), and normalized by the 
median flux in the 4010-4060 A region. The S/N ratio in this spectral window 
spans the 5-30 range, with median value of 14. Besides the masks around 
the lines listed in Section 12.2.1 we exclude points with SDSS flag > 2, which 
signals bad pixels, sky residuals and other artifacts. After this pre-processing, 
the spectra are fed into the STARLIGHT code described in Section 12.1.1 On 
average, the synthesis is performed with N\ = 3677 points, after discounting the 
ones which are clipped by our < 3 sigma threshold (typically 40 points) and the 
masked ones. The spectral fits are generally very good. 

The total stellar masses of the galaxies were obtained from the stellar masses 
derived from the spectral synthesis (which correspond to the light entering the 
fibers) by dividing them by (1 — /), where / is the fraction of the total galaxy 
luminosity in the z-band outside the fiber. This approach, which neglects stellar 



2 Unlike in the first data release, the final calibrated spectra from the DR2 are not corrected for 
foreground Galactic reddening. 
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population and extinction gradients, leads to an increase of typically 0.5 in 
log M*. We did not apply any correction to the velocity dispersion estimated by 
the code given that the spectral resolution of the BC03 models and the data are 
very similar. 

We point out that we did not constrain the extinction Ay to be positive. 
There are several reasons for this choice: (a) some objects may be excessively 
dereddened by Galactic extinction; (b) some objects may indeed require bluer 
SSP spectra than those in the base; (c) the observed light may contain a scat- 
tered component, which would induce a bluening of the spectra not taken into 
account by the adopted pure extinction law; (d) constraining Ay to have only 
positive values produces an artificial concentration of solutions at Ay = 0, an 
unpleasant feature in the Ay distribution. Interestingly, most of the objects 
for which we derive negative Ay (typically —0.1 to —0.3 mag) are early-type 
galaxies. These galaxies are dominated by old populations, and expected to 
contain little dust. This is consistent with the result of K03, who find negative 
extinction primarily in galaxies with a large D n (4000). The distribution of Ay 
for these objects, which can be selected on the basis of spectral or morphological 
properties, is strongly peaked around Ay = 0, so that objects with Ay < can 
be considered as consistent with having zero extinction. In any case, none of the 
results reported in this paper is significantly affected by this choice. 

3.3. Emission line measurements 

Besides providing estimates of stellar population properties, the synthesis mod- 
els allow the measurement of emission lines from the "pure-emission" , starlight 
subtracted spectra (0\ — M\). We have measured the lines of [O n]AA3726,3729, 
[O m]A4363, H/3, [O m]AA4959,5007, [O i]A6300, [N n]A6548, Ha, [N n]A6584 
and [S n]AA6717,6731. Each line was treated as a Gaussian with three pa- 
rameters: width, offset (with respect to the rest-frame central wavelength), 
and flux. Lines from the same ion were assumed to have the same width 
and offset. We have further imposed [O m]A5007/[O m]A4959 = 2.97 and 
[N ll]A6584/[N n]A6548 = 3 flux ratio constraints. Finally, we consider a line to 
have significant emission if its fit presents a S/N ratio greater than 3. 

In some of the following analysis, galaxies with emission lines are clas- 
sified according to their position in the [O iii]/H/3 versus [N ii]/Ha diagram 
proposed by Baldwin, Phillips & Terlevich (1981) to distinguish normal star- 
forming galaxies from galaxies containing active galactic nuclei (AGN). We de- 
fine as normal star-forming galaxies those galaxies that appear in this diagram 
and are below the curve defined by K03 (see also Brinchmann etal. 2004). Ob- 
jects above this curve are transition objects and galaxies containing AGN. 

3.4. Comparisons with the MPA/JHU database 

The SDSS database has been explored by several groups, using different ap- 
proaches and techniques. The MPA/JHU group has recently publicly released 
catalogues 3 of derived physical properties for 211894 SDSS galaxies, including 
33589 narrow-line AGN (K03, see also Brinchmann etal. 2004). These cata- 



3 available at http://www.mpa-garching.mpg.de, 
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logues are based on the K03 method to infer the star formation histories, dust 
attenuation and stellar masses of galaxies from the simultaneous analysis of the 
4000 A break strength, D n (4000), and the Balmer line absorption index H^. 
These two indices are used to constrain the mean stellar ages of galaxies and the 
fractional stellar mass formed in bursts over the past few Gyr, and a comparison 
with broad-band photometry then allows to estimate the extinction and stellar 
masses. 

The MPA/JHU catalogues provide very useful benchmarks for similar stud- 
ies. In this section we summarize the comparison between values of some of the 
parameters from these catalogues with our own estimates. A more complete 
discussion is presented in Cid Fernandes et al. (2005). 

Stellar extinction The MPA/JHU group estimates the z-band stellar extinc- 
tion A z through the difference between model and measured colours, assuming 
an attenuation curve proportional to A -0 ' 7 . In our case, the extinction Ay is 
derived directly from the spectral fitting, carried out with the Milky Way extinc- 
tion law (Cardelli et al. 1989, c.f. Section \2. 1.(1 . These two independent estimates 
are very strongly and linearly correlated, with a Spearman rank correlation co- 
efficient r$ = 0.95. However, the values of A z reported by the MPA/JHU group 
are systematically larger than our values: ^4 Z (MPA/JHU) ~ 2.51v4 2 (This Work) 
in the median. This discrepancy, however, is only apparent because the Galactic 
extinction law is substantially harder than A~ ' 7 . One thus expects to need less 
extinction when modeling a given galaxy with the former law than with the lat- 
ter. Our analysis indicates that there are no substantial differences between the 
MPA/JHU and our estimates of the stellar extinction other than those implied 
by the differences in the reddening laws adopted in the two studies. 

Stellar masses Our results for the total stellar masses compares very well to the 
MPA/JHU extinction-corrected stellar masses, with rs = 0.89. The quantitative 
agreement is also good, with a median difference of just 0.1 dex. This small offset 
seems to be due to a subtle technicality. Whereas we adopt the M/L ratio of 
the best \ 2 model, the MPA/JHU group derives M/L comparing the observed 
values of the -D n (4000) and H5a indices with a library of 32000 models. Each 
model is then weighted by its likelihood, and a probability distribution for M/L 
is computed. The MPA/JHU mass is the median of this distribution, which is 
not necessarily the same as the best-x 2 value. 

Velocity dispersion We use the sub-sample of galaxies with active nuclei to 
compare our measurements of absorption line broadening due to galaxy velocity 
dispersion and/or rotation, a+, with those of the MPA/JHU group, since they 
list this quantity only in their AGN Catalogue. The Spearman correlation- 
coefficient in this case is rs = 0.91 and the median of the difference between the 
two estimates is just 9 kms -1 , indicating an excellent agreement between both 
studies. 

Emission lines and nebular metallicities Brinchmann et al. (2004) also provide, 
in their Emission line Catalogue, data on emission lines which can be compared 
with our own measurements. We have compared the fluxes and equivalent widths 
of Ha, [N n]A6584, [O n]A3727, H/3 and [O m]A5007 as measured by our code 
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and that obtained by the MPA/JHU group. We do not find any significant 
difference between these values; the largest discrepancy (~ 5 per cent) was found 
for the equivalent widths of Ha and [N Ii], probably due to different estimates 
of the continuum level and the associated underlying stellar absorption. Our 
emission line measurements are also in good agreement with those in Stasihska 
et al. (2004), who fit the Balmer lines with emission and absorption components, 
instead of subtracting a starlight model. 

Overall, we conclude that our spectral synthesis method yields estimates of 
physical parameters in good agreement with those obtained by the MPA/JHU 
group, considering the important differences in approach and underlying as- 
sumptions. 



4. Empirical relations 

Yet another way to assess the validity of physical properties derived through 
a spectral synthesis analysis is to investigate whether this method yields as- 
trophysically reasonable results. In this section we follow this empirical line of 
reasoning by comparing some results obtained from our synthesis of SDSS galax- 
ies (which excludes emission lines) with those obtained from a direct analysis of 
the emission lines. Our aim here is to demonstrate that our synthesis results do 
indeed make sense. 

4.1. Stellar and Nebular Metallicities 

Our spectral synthesis approach yields estimates of the mean metallicity of the 
stars in a galaxy, < Z± >. The analysis of emission lines, on the other hand, 
gives estimates of the present-day abundances in the warm interstellar medium. 
Although stellar and nebular metallicities are not expected to be equal, it is 
reasonable to expect that they should roughly scale with each other. 

Fig. [2]shows the correlation between mass- weighted stellar metallicities and 
the nebular oxygen abundance (computed as in Section EiAj) . both in solar units 4 , 
for our sample of normal star-forming galaxies. A correlation is clearly seen, 
although with large scatter (rg = 0.42). Galaxies with large stellar metallicities 
also have large nebular oxygen abundances; galaxies with low stellar abundances 
tend to have smaller abundances. The observed scatter is qualitatively expected 
due to variations in enrichment histories among galaxies. 

Nebular and stellar metallicities are estimated through completely different 
and independent methods, so the correlation depicted in Fig. |2] provides an a 
posteriori empirical validation for the stellar metallicity derived by the spectral 
synthesis. The possibility to estimate stellar metallicities for so many galaxies is 
one of the major virtues of spectral synthesis, as it opens an important window 
to study the chemical evolution of galaxies and of the universe as a whole (Panter 
etal. 2004; Sodre et al. in prep.). 



4 The solar unit adopted for the nebular oxygen abundance is 12 + log(0/H)© = 8.69 (Allende 
Prieto, Lambert & Apslund 2001). 
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Figure 2. Nebular oxygen abundance versus mass-weighted stellar metal- 
licity, both in solar units, for normal star-forming galaxies in our sample. The 
median values and quartiles in bins of same number of objects are shown as 
thin solid lines. The dashed line is a robust fit for the relation. 



4.2. Stellar and Nebular extinctions 

The stellar extinction in the V-band is one of the products of our STARLIGHT 
code. A more traditional and completely independent method to evaluate the 
extinction consists of comparing the observed Ha/H/3 Balmer decrement to the 
theoretical value, the "Balmer extinction" (e.g., Stasihska etal. 2004). 

Fig-Elpresents a comparison between the stellar Ay and ^4^ almer . These two 
extinctions are determined in completely independent ways, and yet, our results 
show that they are closely linked, with r$ = 0.61. A linear bisector fitting yields 
^Balmer _ q ^4 _j_ 1.81^4^. Note that the angular coefficient in this relation indi- 
cates that nebular photons are roughly twice as extincted as the starlight. This 
"differential extinction" is in very good qualitative and quantitative agreement 
with empirical studies (Fanelli et al. 1988; Calzetti, Kinney & Storchi-Bergmann 
1994; Gordon etal. 1997; Mas-Hesse & Kunth 1999). 



Spectral synthesis of SDSS galaxies 13 




1 2 




Figure 3. Relation between the nebular (Ay mel ) and stellar (Ay) extinc- 
tions for our sample of normal star- forming galaxies. The solid line is a robust 
fit for the relation. 



4.3. Relations with mean stellar age 

The equivalent width (EW) of Ha is related to the ratio of present to past star 
formation rate of a galaxy (e.g., Kennicutt 1998). It is thus expected to be 
smaller for older galaxies. Fig. 0^ shows the relation between EW(Ha) and the 
mean light-weighted stellar age obtained by our spectral synthesis. The anti- 
correlation, which has r$ = —0.78, is evident. It is worth stressing that these 
two quantities are obtained independently, since the spectral synthesis does not 
include emission lines. 

Another quantity that is considered a good age indicator, even for galaxies 
without emission lines, is the 4000 A break, D4000. We measured this index 
following Bruzual (1983), who define D4000 as the ratio between the average 
value of F v in the 4050-4250 and 3750-3950 A bands. The relation between 
< logi* >l and D4000 is shown in Fig. |Ij3. Note that the concentration of 
points at the high age end reflects the upper age limit of the base adopted here, 
13 Gyr (c.f. Section 2.2). The correlation is very strong (rs = 0.94), showing that 
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Figure 4. (a) Equivalent width of Ha versus the light-weighted mean stellar 
age for normal emission line galaxies in our sample, (b) Relation between the 
D4000 index and the light-weighted mean stellar age. The solid lines are 
robust fits for the relations. 

indeed D4000 can be used to estimate empirically mean light-weighted galaxy 
ages, despite its metallicity dependence for very old stellar populations (older 
than 1 Gyr, as shown by K03). 

4.4. relation 

Fig. [5] shows the relation between stellar mass and cr*, obtained from the syn- 
thesis. The relation is quite good, with r$ = 0.79. The solid line displayed in 
the figure is log M+ = 6.44 + 2.04 log <r* for M* in M@ and cx* in km s _1 , obtained 
with a bisector fitting. The figure also shows as a dashed line a fit assuming 
M+ oc £7~, expected from the virial theorem under the (unrealistic) assumption 
of constant mass surface density. In both cases we have excluded from the fit 
galaxies with o* < 35 kms -1 , which corresponds to less than half the spectral 
resolution of both data and models. 

This is another relation that is expected a priori if we have in mind the 
Faber- Jackson relation for ellipticals and the Tully-Fisher relation for spirals. 
For early-type galaxies, <r* is a measure of the central velocity dispersion, which 
is directly linked to the gravitational potential depth, and, through the virial the- 
orem, to galactic mass. For late-type systems, <r* has contributions of isotropic 
motions in the bulges, as well as of the rotation of the disks, and is also expected 
to relate with galactic mass. Another aspect that it is interesting to point out 
in Fig. [21 is that the dispersion in the M^-cr* relation decreases as we go from 
low-luminosity, rotation-dominated systems, for which the values of a* depend 
on galaxy inclination and bulge-to-disk ratio, to high-luminosity, mostly early- 
type systems, which obey a much more regular (and steeper) relation between 
o± and M*. 

This relation, between a quantity that is not directly linked to the synthesis, 
er*, and another one that is a product of our synthesis, M*, is yet another 
indication that the results of our STARLIGHT code do make sense. 
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Figure 5. Mass-velocity dispersion relation for our sample. The solid line 
is a robust fit for the data while dashed one is a fitting assuming M* oc a 4 . 



5. Recent results 



We have started an analysis of a magnitude-limited sample with 20000 galaxies 
extracted from DR2, aiming to probe galaxies with luminosities smaller than 
those discussed in the previous sections. We present below a summary of some 
of the results obtained so far. 



The bimodality of galaxy populations With the recent advance of redshift sur- 
veys, the study of galaxy populations has quantitatively revealed the existence 
of a bimodal distribution in some fundamental galaxy properties, found both 
in photometric and spectroscopic data: star-forming and passive galaxies have 
quite distinct properties. Perhaps the most representative bimodal distribution 
is that found in galaxy colours. Since photometric parameters are easily mea- 
sured, their bimodal behavior has been studied for galaxies in the local universe 
by using both the SDSS and the Two Degree Field Galaxy Redshift Survey 
(2dFGRS) data (Strateva et al. 2001; Hogg et al. 2002; Blanton et al. 2003, 
Wild et al. 2004). A bimodality has been found in other galaxy parameters, 
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like mass (Kauffmann et al. 2003b) and star formation properties (Madgwick 
et al. 2002; Wild et al. 2004; Brinchmann et al. 2004). We show in Fig. 6 
that the bimodal character of galaxy populations is also clearly present in some 
parameters derived from our spectral synthesis (Mateus et al., in prep.). These 
results suggest that galaxies have evolved through two major paths. 




(u-r) (u-r) (u-r) 




Figure 6. Results of the spectral synthesis of 20000 galaxies of a magnitude- 
limited sample extracted from SDSS showing properties of star-forming (SF) 
and passive (P) galaxies. The vertical lines correspond to value of the param- 
eters (colours or D4000) which provide the best separation between these two 
classes of galaxies. 



The down-sizing of galaxy populations Another interesting result emerging 
from our synthesis of SDSS galaxies is the confirmation that there is a correla- 
tion between mass and galaxy age, in the sense that most of the stars in massive 
galaxies were formed long time ago, whereas galaxies with a large fraction of 
young or intermediate-age stars tend to be less massive. This phenomenon, 
known as the down-sizing of galaxy populations (Cowie et al. 1996; Juneau et 
al. 2004; Kodama et al. 2004), is clearly seem in our results with this new 
sample and, actually, is an essential piece of information on galaxy formation 
and evolution. 

The problem of the alpha- enhancement A major problem of spectral synthesis 
with a spectral base using BC03 spectra is that while their evolutionary tracks 
are scaled from the solar values, non-solar abundance patterns may occur in 
galaxies, mainly in early- types (e.g., Trager et al. 2000a,b). In practice, as 
discussed by BC03, lines associated to a-elements are not fitted as well as other 
lines. This problem is indeed present in our results, as shown in Fig. 8 for 
a subsample of 2488 bona fide ellipticals with high S/N spectra. This figure 
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Figure 7. Relation between stellar mass and mean mass-weighted stellar 
ages for galaxies in the magnitude-limited sample of 20000 galaxies extracted 
from SDSS. 



indicates that the residuals in the absorption lines of some a-elements correlate 
with the depth of the gravitational potential well of the galaxy (Cid Fernandes et 
al., in prep.), suggesting that our spectral synthesis results of early- type galaxies 
may be biased due to inadequacy of the spectral base adopted here to synthesize 
this type of galaxy. 



6. Summary 

We have developed and tested a method to fit galaxy spectra with a combination 
of spectra of individual simple stellar populations generated with state-of-the 
art evolutionary synthesis models. The main goal of this investigation was to 
examine the reliability of physical properties derived in this way. This goal was 
pursued by three different means: simulations, comparison with independent 
studies, and analysis of empirical results. 

Our simulations show that the individual SSP strengths, encoded in the 
population vector x, are subjected to large uncertainties, but robust results can 
be obtained by compressing x into coarser but useful indices. In particular, phys- 
ically motivated indices such as mean stellar ages and metallicities are found to 
be well recovered by spectral synthesis even for relatively noisy spectra. Stel- 
lar masses, velocity dispersion and extinction are also found to be accurately 
retrieved. 

We have applied our STARLIGHT code to a volume limited sample of over 
50000 galaxies from the SDSS Data Release 2. The spectral fits are generally 
very good, and allow accurate measurements of emission lines from the starlight 
subtracted spectrum. We have compared our results to those obtained by the 
MPA/JHU group (K03; Brinchmann etal. 2004) with a different method to 
characterize the stellar populations of SDSS galaxies. The stellar extinctions 
and masses derived in these two studies are very strongly correlated. Further- 
more, differences in the values of Ay and are found to be mostly due to the 
differences in the model ingredients (extinction law). Our estimates of stellar 
velocity dispersions and emission line properties are also in good agreement with 
those of the MPA/JHU group. 

The confidence in the method is further strengthened by several empiri- 
cal correlations between synthesis results and independent quantities. We find 
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Figure 8. Residuals (measured as equivalent widths) of the spectral synthe- 
sis in the region of CN (4132-4196), Mg (5015-5190) and Na D (5874 - 5911) 
lines as a function of the galaxy velocity dispersion. The data is for a sample 
of 2488 early-type galaxies. 



strong correlations between stellar and nebular metallicites, stellar and nebu- 
lar extinctions, mean stellar age and the equivalent width of Ha, mean stellar 
age and the 4000 A break, stellar mass and velocity dispersion. These are all 
astrophysically reasonable results, which reinforce the conclusion that spectral 
synthesis is capable of producing reliable estimates of physical properties of 
galaxies. These results validate spectral synthesis as a powerful tool to study 
the history of galaxies. 

We have also presented some preliminary results of an analysis of a magni- 
tude-limited sample containing 20000 galaxies from SDSS which clearly reveal 
that the bimodality of galaxy populations is present in the parameters obtained 
from the synthesis. Our results are also consistent with the "down-sizing" sce- 
nario of galaxy formation and evolution. Finally, we point out one of the ma- 
jor problems of the spectral synthesis of early-type systems: the spectral base 
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adopted here is based on solar-scaled evolutionary tracks which may not be 
appropriate for this type of galaxy. 
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